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Abstract —This paper considers the jointiy optimai piiot and 
data power aiiocation in singie ceii upiink massive MIMO 
systems. A ciosed form soiution for the optimai iength of the 
training intervai is derived. Using the spectrai efficiency (SE) as 
performance metric and setting a totai energy budget per co¬ 
herence intervai the power controi is formniated as optimization 
probiems for two different objective functions: the minimum SE 
among the users and the sum SE. The optimai power controi 
poiicy is found for the case of maximizing the minimum SE by 
converting it to a geometric program (GP). Since maximizing 
the sum SE is an NP-hard probiem, an efficient aigorithm is 
deveioped for finding KKT (iocai maximum) points. Simuiation 
resuits show the advantage of optimizing the power controi over 
both piiot and data power, as compared to heuristic power controi 
poiicies. 

I. Introduction 

Massive multiple-input-multiple-output (MIMO) have re¬ 
cently attracted a lot of attention m, El. The idea of massive 
MIMO is to use a large amount of antennas at the base station 
(BS) to serve multiple users at the same time and frequency 
resource block. The ability to increase both spectral efficiency 
(SE) and energy efficiency makes it one of the key candidates 
for the 5G cellular networks. The analysis of the performance 
of massive MIMO is of vast importance and has been done 
in El for uplink single cell systems and in ID for multi-cell 
systems. However the analysis is done with the assumption of 
equal power allocation among the users. Only a few papers 
has considered power control, however there has not been 
much optimization of the powers. In order to harvest all the 
benefits brought by the massive antenna arrays, power control 
among the users is necessary. This can be done by varying 
the power of different users to either increase the total system 
performance or provide services with certain fairness. 

Power control in wireless cellular network has been an 
important problem for decades, dating back to point to point 
(P2P) wireless systems. Lots of efforts have been put into 
developing efficient algorithms for maximizing the system 
performance with different objectives. Due to the interference 
from other users the power control is usually hard to solve 
optimally, in particular NP-hardness was proven in Q for the 
objective of maximizing the sum performance. Eor practical 
use a reasonable approach is to develop suboptimal algorithms 
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with affordable complexity while achieving an acceptable 
performance, as done for example in 0. 

Compared to power control in P2P systems, power control 
in massive MIMO networks is a relatively new topic. Accurate 
channel estimates are needed at the BS for carrying out coher¬ 
ent linear processing, e.g. uplink detection and downlink pre¬ 
coding. Due to the large number of antennas in massive MIMO 
the instantaneous channel knowledge, which is commonly 
assumed to be known perfectly in the power control literature, 
is hard to be obtained perfectly. Therefore one needs to take 
into account both the pilot power and payload power, and 
hence optimal power control becomes even harder in massive 
MIMO. This brings a new challenge to designing algorithms 
for optimal power control to achieve different objectives. On 
the other hand, the channel hardening in massive MIMO 
makes it possible to do power control based on the large-scale 
fading rather than small-scale fading. Several work tried to 
tackle this hard problem. In Q the authors optimize the data 
power for providing every user the same throughput in multi¬ 
cell massive MIMO. In fS) power control is done to minimize 
the uplink power consumption under target SINK constraints. 
However we are not aware of any work that jointly optimize 
the pilot and data power for massive MIMO. The questions 
we want to answer in this paper are; 

1) Is power control on the pilots needed for massive MIMO 
systems? If the answer is yes, how much can we gain 
from jointly optimizing the pilot power and payload 
power, as compared to always using full power? 

2) What intuition can be obtained from the optimal power 
control? This includes the pilot length, and how the pilot 
and payload power depend on the channel quality. 

In this paper we provide partial answers to these questions 
in the case of single cell operation with maximum ratio 
combining (MRC) at the BS. This is done by formulating and 
solving the optimization problems and comparing the results 
with simple heuristic power control policies. Two commonly 
used performance objectives, namely max-min SE and sum 
SE optimization, are investigated and efficient optimization 
algorithms are developed. 

H. System Model 

We consider uplink single cell massive MIMO systems with 
M antennas at the BS and K single-antenna users. The K 
users are assigned K orthogonal pilot sequences of length Tp 



for K < Tp < T, where T is the length of the coherence 
interval in which the channels are assumed to be unchanged. 
Denote the Rayleigh fading channels between the BS and the 
users as G € where the columns of G have the 

distributions 


Pp = C/Pk with a normalization constant C. These previous 
work can all be included in our framework by setting different 
variables to be constant. Therefore our framework of power 
control is the most general and the algorithms we develop in 
this work can be applied to all the above scenarios. 


fc = l,2,...,iT, ( 1 ) 

which is a circularly symmetric complex Gaussian random 
variable, and the variance Pk represents the large-scale fading 
including path loss and shadowing. The large-scale fading 
coefficients are assumed to be known at the BS as they 
are varying slowly (in the scale of thousands of coherence 
intervals) and can be easily estimated. The schemes proposed 
in this paper only depends on the large-scale fading which 
make it feasible to optimize the power control online. 

In each coherence interval, each user k transmits an orthog¬ 
onal pilot sequence with power Pp for channel estimation. We 
assume that minimum mean squared error (MMSE) channel 
estimation is carried out at the BS to obtain the small-scale 
coefficients. This gives an MMSE estimate of the channel 
vector from user k as 

J-^pPph , ,—- . 

where ~ GiV(0,I) accounts for the additive noise during 
the training interval. The noise has been normalized to unit 
variance and the variance is absorbed into During the 
payload transmission interval, the BS received the signal: 

K 

y ^'^gkptsk + n (3) 

k=l 

where Sk is the unit variance information symbol from user 
k and n ~ CN{Q,T) represents the noise during the payload 
transmission interval. The noise has been normalized to have 
unit variance and the variance is absorbed into the payload 
power pj). The channel estimates are used for MRC detection 
of the payload, which corresponds to multiplying the y with 
gjP to detect the symbol Sk- To make fair comparison with the 
scheme with equal power allocations, we impose the following 
constraint on the total transmit energy over a coherence 
interval: 


Tpp^p + {T- Tp)pl < Ema., k = 1, . . . , K (4) 

where Emax is the total energy budget for each user within 
one coherence interval. In previous work, p^ and p^ have been 
optimized separately or often not optimized at all in which the 
benefit of massive MIMO cannot be fully harvested. Therefore 
we consider the scenario where each user can choose freely 
how to allocate its energy budget on the pilots and payload. 
In 0 Pp and pj) are set equal for every user. The work 
m optimized the payload power to maximize the minimum 
throughput, which corresponds to fixing p^ for every user 
and optimizing over the pj). The work Eol adopted inverse 
power control for the pilot power, which corresponds to setting 


III. Achievable SE With MRC 


Since the exact ergodic capacity of the uplink multiuser 
channels with channel uncertainty is unknown, bounds on the 
achievable SE are often adopted as the performance metric in 
the massive MIMO literature. Here we develop a lower bound 
for arbitrary power control with the same methodology using 
the Jensen’s inequality as in 0. This achievable SE for user 
k is given by the following proposition. 

Proposition 1: An achievable SE for user k with power 
control on the pilot and payload power is given by 

(l-^)log2(l + SINRfe) (5) 


where the SINR of user k is 


SINRfc 


_(M - l)ptPpPlTp _ 

1 + J2f=l + TpPkPp + TpP^Pk T,j^k 

(6) 


Proof: Using a similar approach as in 0 by treating all 
additive interference as a worst-case Gaussian noise and then 
applying Jensen’s inequality we can arrive at the result. The 
detailed proof is omitted here due to lack of space. ■ 

This achievable SE is used as the user performance metric 
throughout the paper, where Tp, Pp and pj) are the variables 
to be optimized (for k = The optimization can 

be done at the BS, which can then inform the users about 
the number of pilot symbols, the amount of power to be 
spent on training, and the amount of power to be spent 
on payload data. The aim is to maximize a given utility 
function U{Ri ,..., Rk) where U{-) can be any function that 
is monotonically increasing in every argument. The utility 
function characterizes the performance and fairness that we 
provide to the users. Examples of commonly used utility 
function are the max-min fairness, sum performance, and 
proportional fairness. The general problem we are trying to 
solve is: 


maximize U (Ri,... ,Rk) 

Tp,{Pp}Apt} 

subject to TpPp +{T - Tp)p'^ < Emax^k 
P^p>0,pt>Qyk,K<Tp<T. 

IV. Optimal Training Interval 


In this section we derive the optimal length of the training 
interval in closed form. Eirst we provide the following lemma: 


Lemma 1: Eor any monotonically increasing utility func¬ 
tion, the energy constraint 0 is satisfied with equality for 
every user at the optimal solution, i.e. 

Wp + {T - Tp)p'^ = Emax, k = l,...,K (8) 

at the optimal point. 






Proof: Observe that we use orthogonal pilot sequences for 
each user and therefore the SINRs in are monotonically 
increasing in for every user k. For any power allocation in 
which some users do not use the full energy budget, they can 
each increase their pilot power to improve their own SINR 
until their energy constraint is satisfied with equality, without 
causing interference to any other users. ■ 

Then we can state the following theorem which gives the 
optimal length of training interval in closed form. 

Theorem 1: For any monotonically increasing utility func¬ 
tion U{Ri,... ,Rk), the optimal Tp equals to K. 

Proof: 

We prove this by applying that the function g{x) = 
log (^1 + is ^ strictly monotonic increasing function in 

X. The detailed proof is omitted here due to lack of space. ■ 
Using the result in Theorem [T] we can reduce the number of 
variables involved in the optimization and this enable us to 
hnd the optimal solutions for certain utility functions in the 
next section. Also from Theorem [T] we know that the optimal 
training period Tp is equal to the number of users being served, 
and is the same for every user. Therefore there is no need for 
assigning pilot sequences of different length for different users. 

V. Joint Power Control of Pilots and Payload 

In this section we focus on solving the power control prob¬ 
lem (|7]i for two different utilities, namely max-min fairness 
and the sum performance, while other utilities are left for 
future work. These are the two extreme cases; totally fair and 
ignoring fairness to achieve high total throughput. 

A. Maximize the Minimum SE 

In the max-min fairness which aim at serving every user 
in the cell with equal SE. This is corresponding to choosing 
U{Ri,..., Rk) = minfci?fc. By using Theorem [1] Q now 
becomes the following optimization problem; 

maximize min Rk 

{Pp}, {Pp} ^ 

subject to TpPp + {T - Tp)p^ < Emax,'^k (9) 

P^p>o,pt>oyk. 

Since log(l-|-a:) is an increasing function of x, we can remove 
the logarithm in the objective and use the epigraph form; 


maximize 

{Pp},{pt}. 

subject to 


A 


(M - i)ptp';plTp > 

K 

•^(1 + XI + 'rpPkPp+ 

j=i 

Tpp';lik'^li]pL),yk 

jAk 

TpPp + {T - Tp)p’^ < E^ax,yk 

Pp > > 0,VA;. 


( 10 ) 


This problem is non-convex as it is formulated here, how¬ 
ever it is a geometric program (GP). Since the objective func¬ 
tion is a monomial and the constraints are valid posynomial, 
this can be solved efficiently with any GP solvers, here we use 
the MOSEK solver ifTTl with CVX ifT^ . Alternatively, (ITOl i can 
be turned into a convex optimization problem by the change 
of variable; pi = logXi for every variable Xi (in this case 
Pp,p^,X). Then any function in the form m{x) = 
with c > 0, which is defined as a monomial, becomes an 
exponential of an affine function. Sum of monomials, defined 
as the posynomial, becomes sum of exponentials of affine 
functions. Einally taking logarithm of every objective and 
constraint function we obtained a convex problem. 


B. Maximize the Sum SE 

In this part, we aim at maximizing the sum SE by choosing 
U{Ri, ..., Rk) = J2k=i By using Theorem [T] O now 
becomes the following optimization problem; 


maximize 

{Pp}, {pt} 

subject to 


^ log(l + SINRfc) 

k 

TpPp + {T- Tp)p'l < Emax.^k 

Pp>o,pt>oyk. 


( 11 ) 


Power control to maximize sum performance is known to be 
an NP-hard problem, even under perfect channel knowledge. 
Therefore in this paper we aim at hnding a local optimal 
solution with affordable computational complexity. Here we 
adopt a successive convex optimization approach to converge 
to a Karush-Kuhn-Tucker (KKT) point of (fTTT l 1(131 . We first 
reformulate the problem using the epigraph form as 


maximize I I A^ 

{pJITpSI. {AG Y 

subject to TpPp + {T - Tp)p’^ < E^ax^k (12) 
1-f SINRfe > Afc,Vfc 
pl>0,pl>Q,^k. 

In this form it is clear that the only non-convexity lies in the 
constraints of the SINRs. To deal with these SINR constraints, 
we construct a family of functions /i(p) in each iteration i 
to approximate f{pk) = 1 + SINR^ where we denote pk — 
{Pp,p^, Xk)- This has to be done for every user k and the 
functions need to satisfy the conditions given in the following 
lemma from 113: 

Lemma 2: By constructing a family of functions satisfying 
the following conditions; 

1) /(Pfc) < fi{Pk), Vpfe in the feasible set, 

2) fiPk~^^) = MPk~^^)^ where p^^^ the solution 
from the previous iteration, 

3) V/(pY'’) = V/.(pY'^), 

and optimize the problem by replacing f{pk) with fi{pk) in 
the i-th iteration, the series of the solution will converge to an 
KKT point of the original problem. 

The hrst condition is to ensure that the solution we get is 
feasible for the original problem. The second condition ensures 



that the solution from the previous iteration is feasible for the 
current iteration. As a result the objective value of the original 
problem increases in every iteration since the solution from 
the previous iteration is a feasible point to the problem in 
the current iteration. The second and third conditions together 
guarantee that the KKT conditions for the original problem are 
satisfied at convergence. As the objective value is bounded 
from above and monotonically increasing in every iteration, 
convergence is guaranteed. 

To construct the family of function f(x) we need the 
following lemma from 0: 

Lemma 3: For any posynomial g{x) = mi{x), it holds 
for any that 

5(a:) > 5(3=) = n 

The SINR constraints in (fT2li are in the form h{x)/g(x) < 1, 
which is not a valid posynomial constraint. We apply Lemma 
|3] on the denominator to replace g{x) with g(x) make it 
a valid posynomial constraint. Moreover with ai chosen as 
cti = mi{x[))/g{xQ), the three conditions in Lemma |2] are 
satisfied. Doing this for every SINR constraint we get a convex 
approximation of problem dnii. The same procedures are 
repeated until convergence. To conclude, we obtain a KKT 
point to (fTSl i with the procedure described in Algorithm [T] 

Algorithm 1 Successive convex optimization for problem (fl^ 

1: choose as the solution of max-min problem satisfying 
the constraints and initialize i = \ 

2 : repeat 

3: form the z-th approximated problem of (fT^ by ap¬ 

proximating every SINR constraints using lemma [3l 
4: solve the z-th approximated problem to obtain for 

every user k, 

5: z ^ z -I- 1 

6: until convergence 
7: return all p^*^ 


VI. Simulation Results 

In this section we present simulation results to demonstrate 
the benefits of our algorithms and compare the performance 
with the case of no power control as well as the case of power 
control on the payload power only. There are 5 schemes we 
are comparing here: 1) the solution to problem (fTOl) (marked 
as ‘max-min’ in the figures), 2) Algorithm [T] for problem (ITSli 
(marked as ‘sum’ in the figures), 3) equal power allocation 
Pu = Fp = Emax/T (marked as ‘no control’ in the figures), 
4) optimizing only payload power for problem (fTOl) by fixing 
Pp = Emax/T (marked as ‘max-min (data only)’ in the 
figures), 5) optimizing payload power only for problem (fl^ 
using Algorithm [T] by fixing = Emax/T (marked as ‘sum 
(data only)’ in the figures). We consider a scenario with 
M = 100 antennas, K = 10 users, and the length of the 
coherence interval is T = 200 (which for example corresponds 


to a coherence bandwidth of 200 kHz and a coherence time of 
1 ms). The users are assumed to be uniformly and randomly 
distributed in a cell with radius R = 500 m and no user is 
closer to the BS than 100 m. The path-loss model is chosen 
as Pk = where is the distance of user k from the 

BS. The constant Emax = 0.1 x xT to get a signal-to- 
noise ratio (SNR) of —10 dB at the cell edge when using 
equal power allocation. Hence the users that are closer to 
the BS have higher SNR. The algorithms are run for 1000 
Monte-Carlo simulations where in each snapshot the users are 
dropped randomly in the cell so that the large-scale fading Pk 
changes. 

First we consider the cumulative distribution (CDF) of the 
sum SE. In Figure [T] we plot the CDF of the sum SE for 
the scenario we described. We observe the optimized power 
control increases the sum SE significantly. The whole CDE 
is shifted to the right by almost 8 bit/s/Hz with the proposed 
power control on both the pilot and data as compared to equal 
power allocation. Eor example at the 0.95-likely point, power 
control on the data only increases the sum SE by around 
45%, power control over both pilot and data contributes to 
another 20% increases as compared to equal power allocation. 
Surprisingly even the max-min formulation, which is designed 
for providing fairness, increases the sum SE by around 35% 
comparing to equal power allocation. 

In Eigure |2] we plot the CDE of the minimum SE over 
different snapshots of user locations. We observe that without 
any power control in half of the cases the user with the lowest 
SNR will get less than 0.5 bit/s/Hz. This is not acceptable if 
we want to provide decent quality of service to every user 
being served. With max-min power control for both pilot and 
data we resolve this problem by guaranteeing every user a 
SE of more than 2 bit/s/Hz. Moreover the 0.95-likely point 
is increased by 9 times with power control on the data only, 
and 10 times with power control on both pilot and data with 
respect to equal power allocation. The performance of the sum 
SE formulation is rather surprising. By optimizing the pilot 
and data power for the sum SE also increases the minimum 
SE by 6 times in the 0.95-likely point and guarantee each user 
to have more than 1 bit/s/Hz. 

Einally in Eigure we plot the CDE of the per user SE 
over different snapshots of user locations. We observe that 
without power control the SE of the users varies from almost 
0 bit/s/Hz, which is not acceptable, to 10 bit/s/Hz which is 
probably wasted in practice due to the limited modulation size. 
With sum SE power control the problem is less serious as the 
SE only ranges from 1 b/s/Hz to 3.8 b/s/Hz. With the max- 
min power control we provide almost the same SINR in every 
snapshot, since the SE of each user is very concentrated at its 
median point of 2.2 bit/s/Hz. 

In all the figures we observe that our joint power control 
over both pilots and data behaves similarly to power control 
over only the data. Nevertheless, our joint optimization in¬ 
creases the benefits even further. The extra gain can be up to 
30% if we look at the max-min formulation. The gain of the 
sum SE is smaller, but can still be up to 20%. This shows if 








Fig. I. CDF of the sum SE with M = 100, K = 10, T = 200, R = 500m 
and cell edge SNR of —10 dB. 

we need to optimize the power allocation, power control over 
both pilots and data is the right approach. 

Some additional insights on how the optimal power alloca¬ 
tion is can be obtained from the solutions. For the max-min SE 
formulation, users that are closer to the BS spend more power 
on the pilots and users that are further away spend more power 
on the payload. This is due to the fairness property of the 
max-min formulation, which reduces the interference caused 
from the users that are closer to the BS in order to reduce the 
near-far effect. For the sum SE formulation, both users that 
are very close and further than certain distance threshold dtt 
spend more power on pilots. This is because we require a more 
accurate channel estimate from the users that are very far and 
reduce the interference from the users that are very close to 
the others. This threshold depends on all Pk and Emax- More 
in depth characterization of dth is left for future work. 

Vll. Conclusion and Future Work 

We considered the optimal joint pilot and data power allo¬ 
cation problems in single cell uplink massive MIMO systems 
with MRC detection. A closed form solution for the optimal 
length of training interval was hrst derived. Using the SE 
as performance metric and setting a total energy budget, the 
power control was formulated as optimization problems for 
two different objective functions; the minimum SE and the 
sum SE. The optimal power control policy was found for the 
case of maximizing the minimum SE by converting it to a 
GP. Since maximizing the sum SE is an NP-hard problem, 
an efficient suboptimal algorithm was developed for hnding 
KKT (local maximum) points. Simulation results showed the 
advantage of joint optimization over both pilot and data power. 
The gain over power control on data only can be up to 30% 
for the minimum SE and 20% for the sum SE. Future work 
includes extension to the multi-cell systems and taking into 
account other detection methods. 



Fig. 2. CDF of the minimum SE with M = 100, K = 10 and T = 200, 
R = 500m and cell edge SNR of —10 dB. 



Fig. 3. CDF of the per user SE with M = 100, K = 10 and T = 200, 
R = 500m and cell edge SNR of -10 dB. 
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